DigOver: Towards Distributed & Collaborative Overlay Fault Diagnosis Based On User-level Belief Revision
نویسندگان
چکیده
Overlay networks have emerged as a powerful and flexible platform for developing new disruptive network applications. The attractive characteristics of overlay networks such as planetary-scale distributions, user-level flexibility (e.g. overlay routing) and manageability bring to overlay fault diagnosis new challenges, which include inaccessible underlying network information, incomplete and inaccurate network status observations; dynamic symptom-fault causality relationships, and multi-layer complexity. To address these challenges, we propose a distributed user-level Belief Revision based overlay fault diagnosis technique called DigOver. DigOver can passively use observed overlay symptoms as reported by overlay monitoring agents to correlate and diagnose faults, and select the least-costly appropriate probing actions whenever necessary to enhance the passive fault reasoning results. DigOver adapts to the changes in highly dynamic overlay networks by incrementally revising user beliefs based on new observed overlay symptoms. DigOver can diagnose faults without relying on underlying network fault probabilistic quantifications (e.g. prior fault probability). Simulations and experimental studies show that DigOver can efficiently (e.g. low latency) and accurately localize root causes of overlay faults/problems, even when the observed symptoms are incomplete.
منابع مشابه
Towards End-to-end Network Fault Diagnosis Based on User-level Observations
Overlay networks have emerged as a powerful and flexible platform for developing new disruptive network applications. The attractive characteristics of overlay networks such as planetary-scale distributions, user-level flexibility (e.g. overlay routing) and manageability bring to overlay fault diagnosis new challenges, which include inaccessible underlying network information, incomplete and in...
متن کاملOnline Fault Detection and Isolation Method Based on Belief Rule Base for Industrial Gas Turbines
Real time and accurate fault detection has attracted an increasing attention with a growing demand for higher operational efficiency and safety of industrial gas turbines as complex engineering systems. Current methods based on condition monitoring data have drawbacks in using both expert knowledge and quantitative information for detecting faults. On account of this reason, this paper proposes...
متن کاملAutomating the Librarian: Belief Revision as a Base for System Action and Communication with the User
This paper describes a current research project investigating belief revision in intelligent systems by modelling the librarian in interaction with a literature-seeking user. The work is designed both to test a theory of agent behaviour based on belief revision proposed by Galliers, and to evaluate a model of the librarian developed by Belkin, Brooks and Daniels, through computational implement...
متن کاملIntelligent Distributed Fault Management for Communication Networks
In this paper, we present an intelligent, distributed fault management system for communication networks using belief networks as fault model and inference engine. The managed network is divided into domains and for each domain, there is an intelligent agent called Domain Diagnostic Agent attached to it, which is responsible for this domain's fault management. Belief network models are embedded...
متن کاملAAAI-96 Workshop on Internet-Based Information Systems (Workshop Report)
hypermedia with cases indexed by neural net and with a payoff matrix model to determine optimal actions (including communications) for a decision-support system. The session on natural language featured talks on selecting dialog moves by using a subordinate instantiation of the system (global anticipation feedback) to predict user responses, dialog strategies for tutoring the user (modeled as a...
متن کامل